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I. AMENDMENT 

Amendment of the Specification 

The paragraph beginning on page 8 line 1 is re- written as follows: 

Fig. 5: Amino acid sequence comparison of the binding protein and the inhibitor of 
human hepatocvte growth factor activator (HAI-1) . Twelve- amino acid (GPPPAPPGLPAG) 
rSEO ID NO: 2) and seven-amino acid (TQGFGGS) (SEP ID NO: 3) sequences of die amino 
termini obtained from the 40-kDa binding protein doublet and the 25-kDa binding protein, 
respectively, and were identical to amino acids 36-47 and 154-160 of HAl-I (SEQ ID NO; 
H. In addition, 12 unique peptides from the cryptic digest of the larger band of the 40-kDa 
binding protein doublet were compared with HAM by MALDI-MS. These peptides covered 
87 residues that spanned positions 135-310, or 17% of the entire HAI-1. The two stretches of 
ammo-terminal protein sequences are double-underlined, and those 12 peptides identified by 
MALDI-MS, including residues 135-143, 154-164, 165-172, 173-182, 173-190, 183-190, 
194-199, 203-214, 204-214, 288-301, and 302-310 fSEOIDNO: H are underlined. 

The paragraph beginning on page 10 line 10 is re-written as follows: 

Fig. 9: The nucleotide and deduced amino acid sequen ces fSEO ID NO: r r 311 4) of a 
matriptase cDNA clone . The primers (20 bases at the 5' -end and 18 bases at the 3'- end) 
used for reverse transcriptase-polymerase chain reaction are underlined. Thirty- three bases . 
beyond the 5 '-end primer and 92 bases beyond the 3' -end primer were taken from SNCl 9 
cDNA and incorporated. The cDNA sequence (SEQ ID NO: [[1]] 5) was translated from the 
fifth ATG codon in the open reading frame. Nucleotide and amino acid numbers are shown 
on the left. Sequences that agreed with the internal sequences obtained from matriptase are 
double-underlined. His-1-84, Asp-539, and Ser-633 are boxed and indicated the putative 
catalytic triad of matriptase. Potential N-g)ycosylation sites aft are indicated (A). An RGD 
sequence is indicated (£), 



30493407 vl 

PAGE 4/26* RCVD AT 11/23/2004 10:13:54 AM [Eastern Standard Time] 1 SVR:USPT0-EFXRR'2 * DNES:8729306 * CSID:703 905 2500 * DURATION (mm-ss):0742 



23.-H-04 10:09 F rora-P I LLSBURY WINTHROP 



703-805-2500 



T-852 P005/0Z6 F-743 



Application No. 09/936,333 

Election and Preliminary Amendment dau:d Nov. 23, 2004 
Id Reply lo Restriction Requirement dated June 3, 2004 
Attorney Ref. No.: 082137-0280655 

The paragraph beginning on page 10 line 21 is re-written as follows: 

Fig. 10; Comparison of the amino acid sequence of the C-terminal region of 
matriptase with trypsin, chymo trypsin, and with the catalytic domains of other serine 
proteases. The C-terminal region (amino acids 431-683) (SEP ID NO: 6) of matriptase is 
compared with human trypsin (SEP ID NO: 13) (Emi et al„ Gene (Amst.) 41 : 305-10 
(1986)); human chymotrypsin (SEP TP NO: 140 (Tomita etal, Biochem. Biophys. Res. 
Commun. 158: 569-75 (1989)); the catalytic chains of human enteropeptidase CSEP ID NO: 
7} (Kitamoto et al> Proc. Natl Acad. Sci. USA 91: 7588-92 (1994)), human hepsin (SEP ID 
NQ; JO) (Leytus et al, Biochemistry 27: 1067-74 (1988)), human blood coagulation factor XI 
f SEP ID NO: 1 1 ) (Fujikawa et al, Biochemistry 25: 2417-24 (1986)), and human 
plasminogen (SEP ID NP: 12) : and the serine protease domains of two transmembrane 
serine proteases, human TMPRSS2 (SEP TP NP: 8) (PaoJoni-Giacobino et al, Genomics 44: 
309-20 (3997)) and the Drosophila Stubble-stubbloid gene (Sb-sbd) (SJEO ID NP: 9) (Appel 
etal, Proc. Natl Acad. Sci. USA 90: 4937-41 (1993)). Gaps to maximize homologies are 
indicated by dashes. Residues in the catalytic triads (matriptase His-484., Asp-539, and Ser- 
633) are boxed and indicated (A). The conserved activation motif ((R/K)VIGG) (SEP ID 
NP: 34) is boxed, and the proteolytic activation site is indicated. Eight conserved cysteines 
needed to form four intramolecular disulfide bonds are boxed, and the likely pairings are a$ 
follows: Cys-469-Cys-485, Cys-604-Cys-618, Cys-629-Cys-658, and Cys-432-Cys-559. The 
disulfide bond Cys-432-Cys-559: The disulfide bond Cys-432-Cys-559 is observed in two- 
chain serine proteases, but not in trypsin and chymotrypsin. Residues in the substrate pocket 
(Asp-627, Gly-655, and Gly-665) arc boxed and indicated (*). It is evident that the residue 
positioned at the bottom of the substrate pocket is Asp in trypsin-like proteases, including 
matriptase, but Ser in chymotrypsin. 

The paragraph beginning on page 1 1 line 19 is re-written as follows; 

Fig. 11: Alignment of partial s equences of the noncatalvtic domain with those of 
homologous regions in other proteins . A, the cysteine-rich repeats of matriptase (amino acids 
280-314, 315-351, 352-387, and 394-430) [SEP ID NP: 15) are compared with the 
consensus sequences of the human LDL. receptor (SEP TP NP: 16) (Sudhof et al. Science 
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267: 8544-57 (1992)), and « CiMOO J , clr/s , type sequenC es 
„„„ 1 , « as r 1989)) The consensus sequences are boxed. *. »- 

783-87 (1986)), CI. (amino acids )75-283) ^S*^!— L Ra , leact ive 

«-7 <•* MQR7Y andTosieroi., Biochemistry 26. 8516-^U^ ;>> 
Kad** 169: 547-53 (1987 fi , ^ Bioc , em . 

factor (JRaRF) (ammo acids 185-290) lisfiLiii! »* 

icM- 1003-9 (1993): and Sato « at. Im. Immunol. 6. 665 9 
jta. Commurt. 196: 1003-9 V»J». (§ EQffiJ3£a& ( Kin ° sluta 

calcium dependent serine protease (CSP) (amino acids 181-289) £§SU^ 
« r^n. 250- 41 1-5 (1989)). Tbe consensus sequences are boxed. 
Theparagraph beginning onpa g e 13 line 15 is rewritten as foUows: 

the paragraph beginning on page 13 line 22 is rewritten as follows: 

L Bitt l .Chem .,274. 18231 ftU^M ac SEO rn NO - ([3]] 5 U a truncated matxiptasc 

GenBat* Accession No. AAD42765. 
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The paragraph beginning or, page 14 Une 17 is re-written as folios: 
Another object of the invent is ,o provide nucleic acid molecules (SEQ ID NOS^ 
U l]]4«,d [[2]] 26) encoding matriptase proteins or polypeptide fragments thereof (SEQ ID 
NOS: [[3]] Sand [[4]] 27). 

The paragraph beginning on page 14 line 20 is re-written as follows: 
It is a further object of the invention to provide an antibody or antibodies which 
recognizes and binds to SEQ ID NO: [[3]] £ or a fragment thereof, SEQ ID NO: & 
fraglt thereof, ro a single-chain (zymogen) form of matrix or „*^<£** 
frcTof mauiptase. Preferred antibodies „ monoclonal antibodies and fragments thereof as 
well as chimeric, humanized or human antibodies. 

The paragraph beginning on page 17 line 21 is re-written as follows: 
By "matriptase" is meant a rrypsin-HUe protem. with a molecular weight of between 
72-U>a and 92-kDa and is related to SEQ ID NO: [14]] 27 or is a fragment .hereof. It can 
inc iude both single-chain and doub.e-chain forms of the protein. 

(inactive form) of matriptase ,s a single-chain protein. The two-cham form of matnptase 
Lactiveform of matriptase, which possesses catalytic activity. B om forms of ma^ptase 
„ found to some extent in mi* and cancer cells because extracellular matnx C^CM 
modeling is necessary to both normal and pathologic remodehng p~ 
displays al, Known forms of matriptase. Both cancer cells and mil, contam the deferent 
fo L of matriptase. However, in mi* the dominant form is the activated form of matnptase 
which then binds to HAM . 

The paragraph beginning on page 22 line 9 is re-written as follows: 
The present invention further provides nucleic acid mo.ecules that encode the protein 
having SEQ ID NO: [[3]] 5 or SEQ ID NO: [[4]3 2i or fragments thereof , and related 
proteins, which are preferabiy in isolated form. As used herein, "nucle.c acd" . defi ed » 
^A or DNA that encodes a peptide as defined above, or is complementary to nucletc actd 
S eo,ence encode such peptides, orhybridizes to such nucleic acid and remains stably 
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bound to it under appropriate stringency conditions, or encodes a polypeptide sharing at least 
75% sequence identity, preferably at least 80%, and more preferably at least 85%, with the 
peptide sequences. Specifically contemplated are genomic DNA T cDNA, mRNA and 
amisense molecules, as well as nucleic acids based on alternative backbone or including 
alternative bases whether derived from natural sources or synthesized. 

The paragraph beginning on page 24 line 14 is re-written as follows: 

As described above, the identification of the human nucleic acid molecule having 
SEQ ID NO: I[1J] 4 or SEQ ID NO: [[2]] 2£ allows a skilled artisan to isolate nucleic acid 
molecules that encode other members of the matriptase family, in addition to the human 
sequence herein described. Further, the presendy disclosed nucleic acid molecules allow a 
skilled artisan to isolate nucleic acid molecules that encode other members of the matriptase 
family of proteins in addition to the disclosed protein having SEQ ED NO: [[3]] 5 and SEQ 
JD NO: [[4]] 27. 

The paragraph beginning on page 24 line 21 is re- written as follows: 

Essentially, a skilled artisan can readily use the amino acid sequence of NO: [[3]] 5 or 
SEQ ID NO: [[4]] 27 to generate antibody probes to screen expression libraries prepared 
from appropriate cells- Typically, polyclonal antiserum from mammals, such as rabbits, 
immunized with the purified protein (as described below) or monoclonal antibodies can be 
used to probe a mammalian cDNA or genomic expression library, such as Xgtll library, to 
obtain the appropriate coding sequence for other members of the protein family. The cloned 
cDNA sequence can be expressed as a fusion protein, expressed directly using its own control 
sequences, or expressed by constructions using control sequences appropriate to the panicular 
host used for the expression of the enzyme. 

The paragraph beginning on page 29 line 9 is re-written as follows: 

First, a nucleic acid molecule is obtained that encodes a protein of the invention, such 
as the nucleic acid molecule depicted in SEQ ID NOS: [[1]] 4 or [[2]] 26, or particularly for 
the matriptase nucleotides encoding for example, the serine protease catalytic domain in the 
carboxy terminus of the matriptase protein or the LDJL domain. The coding sequence is 

6 

304934O7v 1 

PAGE 8/26 * RCVD AT 11/23/2004 10:13:54 AM [Eastern Standard Time] * SVR:USPT0-EFXRF-1/2 * DNIS:8729306 * CSID:703 905 2500 * DURATION (mm-ss):Q742 



23-11-04 10:11 Frra-PIILSBURY KINTHROP 



703-905-Z500 



T-852 P 009/026 F-T43 



Application No. 09/936,333 

Election ant! Preliminary Amendment dated Nov «.2W4 
In Reply to Resifiction Requirement dated June 3. 
Attorney Ref. No.: 082 1 37-0280655 

directly suitaUe for expression in any host, as it is not interrupted by introns. The sequence 
can be transfected into host cells such as eukaryotic cel.s or prokaryotic cells. Eukaryonc 
hosts include mammalian cells (** , HEK293 cells, CHO cells and PAE-PDGF-R ceHs), as 
well as insect cells such as Sf9 cells using recombinant baculovixus. Alternatively, fragments 
encoding only portion of matriptase can be expressed alone or as a fusion protem. For 
example, the C-terminus of matriptase containing the serine protease domain can be 
expressed in bacteria as a GST- or His-tag fusion protein. These fusion protects can then 
purified and used to generate polyclonal antibodies. 

The paragraph beginning on page 30 line 16 is re-written as follows: 

Another embodiment of the present invention provides methods for use in isolating 
and identifying binding partners of matriptase protein, In detail, a protein of me invention ts 
n^edwithapotential binding partner or an extract or fraction of. cell under conditions that 
allow the association of potential binding partner* with the protein of the invention. After 
mixing, peptides, polypeptides, proteins or other molecules that have become associated «nh 
a protein of the invention are separated from the mixture. The binding partner that bound to 
the protein of the invention can then be removed and further analyzed. To identify and 
isolate a binding partner, the entire protein, for instance the entire disclosed protein of SEQ 
ID NO: [[3]] £or SEQ ID NO: [[4]] Zl can be used. Alternatively, a fragment of the protem 
can be used. 

The paragraph beginning on page 33 line 3 is re-written as follows: 
Another embodiment of the present invention provides methods for identifying agents 
that modulate the expression of a nucleic acid encoding a protein of the invention, such as a 
protein having the amino acid sequence of SEQ ID NO: [[3]] 5 or SEQ ID NO: [[4]] 21- 
Such assays may utilize any available means of monitoring for changes in the expression 
level of the nucleic acids of the invention. As used herein, an agent is said to modulate the 
expression of a nucleic acid of the invention, for instance a nucleic acid encoding the protetn 
having the sequence of SEQ ID NO: [[3]} 5 or SEQ ID NO: [[4]] 22, if it is capable of up- or 
down-regulating expression of the nucleic acid in a cell. 
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The paragraph beginning on page 33 line 12 is re-writtcn as follows: 

to one assay format. cell lines tha/contain reporter gene fusions between the open 

Itner may be prepared. Numerous arable fusion panne, are known and read ly 
avTLe inclulg the firefly luciferase gene and the gene encoding cNorampherucol 
Z 7— ^- - a,, W Biochem. I* *«♦ « CeU lines conta,,^ 

Different 

^ control samples identic agents which modulate the expression of anucletc a«d 
encoding the protein having the sequence of SEQ ID NO: [13)] 5 or SEQ ID NO. [[4J] 27 or 
retated proteins. 

T*e paragraph beginning on page 33 line 23 is re-written as follows: 
Additional assay formats may be used to monitor the ability of the agent to modulate 
tte expression of a nucleic acid encoding a protein of the invention such as the protem havtng 
SEQ ID NO: [[3]] 5 or SEQ ID NO: [[4]] 27. For instance, mRNA express^ may be 
Stored directly hybridization to the nucleic acids of the — . CeU ^esare 
exposed to the agent to be tested under appropriate conditions and ume and total RNA 
n^AisisolatedbystandardproceduressuchthosedisclosedinSambrooK,^. 

(Molecular Cloning: A Laboratory Manual, 2nd Ed. Cold Spring ^'^"^^^'j^'^ 
Press 1989) Probes to detect differences in *HA expression .eve.s between cells exposed to 
me agent and control cells may be prepared from the nucleic acids of the invention. It ,s 

under conditions of high stringency. Only high.y commentary nucleic acd hybnds form 
und er conditions of high stringency. Accordingly, the stringency of the assay cond.uons 
determines the amount of complementarity which should exist between two nucietc^d 
strands in order to forrr, a hybrid. Stringency should be chosen to maxmuzc the d.fference 
.tabiUtybetweentheprobe^gethybridandpotentialprobe-.non-targethybnds. 
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The paragraph beginning on page 34 line 20 is re- written as follows: 

Hybridization conditions are modified using known methods, such as those described 
by Sambrook et al. (] 989) and Ausubel et at. (1995), as required for each probe. 
Hybridization of total cellular RNA or RNA enriched for polyA RNA can be accomplished in 
any available format. For instance, total cellular RNA or RNA enriched for polyA RNA can 
be affixed to a solid support, and the solid support exposed to at least one probe comprising at 
least one, or part of one of the sequences of the invention under conditions in which the probe 
will specifically hybridize. Alternatively, nucleic acid fragments comprising at least one, or 
part of one of the sequences of the invention can be affixed to a solid support, such as a 
porous glass wafer. The glass wafer can then be exposed to total cellular RNA or polyA 
RNA from a sample under conditions in which the affixed sequences will specifically 
hybridize. Such glass wafers and hybridization methods are widely available, for example, 
those disclosed by Beanie (WO 95/1 1755). By examining for the ability of a given probe to 
specifically hybridize to an RNA sample from an untreated cell population and from a cell 
population exposed to the agent, agents which up or down regulate the expression of a 
nucleic acid encoding the protein having the sequence of SEQ ID NO: [[3]] 5 or SEQ ID NO: 
[[4]] 22 are identified. 

The paragraph beginning on page 35 line 13 is re-written as follows: 

Another embodiment of the present invention provides methods for identifying agents 
that modulate at least one activity of a protein of the invention, such as the protein having the 
amino acid sequence of SEQ ID NO: [[3]] 5 or SEQ ID NO: [[4]] 22- Such methods or 
assays may utilize any means of monitoring or detecting the desired activity. 

The paragraph beginning on page 37 line 1 10 is re-written as follows: 

While the polyclonal antisera produced in this way may be satisfactory for some 
applications, for pharmaceutical compositions, use of monoclonal preparations is preferred. 
Immortalized cell lines which secrete the desired monoclonal antibodies may be prepared 
using the standard method of KohJer et al, {Nature 256: 495-7 (1975)) or modifications 
which effect immortalization of lymphocytes or spleen cells, as is generally known. The 
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immortalized cell lines secreting the desired antibodies are screened by immunoassay in 
which the antigen is the peptide hapten, polypeptide or protein. When the appropriate 
immortalized cell culture secreting the desired antibody is identified, the cells can be cultured 
either in vitro or by production in vivo via ascites fluid. Of particular interest, are monoclonal 
antibodies which recognize the catalytic domain of matriptase (e.g. , amino acids 432-683 of 
SEQ ID NO: [[3]] 5). 

The paragraph beginning on page 53 Jine 3 is re-written as follows: 

Ex pression of HAI-1 in COS^7 cell : To verify that HAM encodes the binding protein 
of matriptase, we isolated an HAM cDNA fragment by reverse transcriptase-polymerase 
chain reaction (RT-PCR) utilizing mRNA from MTSV 1.1B immortaUzed human luminal 
mammary epithelial cells. Primer sequences for HAM (S-GGCCCGCGCTCTGAAGOTGA 
-3' f^HTDNQ: 28^ and 5'-TTGGCAAGCAGGAAGCAGGG-3*) fSEQIDNO: 29) were 
derived from the published sequence. Standard RT-PCR was carried out using the Advantage 
RT-PCR kit (Clontech; Palo Alto, CA), and the product was ligated into pCR2.1 (Invitrogen; 
Carlsbad, CA) by TA cloning. The sequence of the RT-PCR product was obtained by 
standard methods, and confirmed with the published HAM sequence (Miyazawa et aL J- 
Biol Chem. 268: 10024-8 (1993)). An eukaryotic expression vector was constructed 
(pcDNA/HAM), utilizing the commercially available pcDNA3.1 vector (lavitrogen; San 
Diego, CA). A 1.6 kb EcoRI fragment containing the HAM cDNA was cloned into the 
EcoRI site of pcDNA 3.1 . This construct contains the open reading frame (ORF) of HAM 
driven by a CMV promoter. Correct insertion of the HAM cDNA was verified by restriction 
mapping. Transfections were performed using SuperFeci transfection reagent (QJAGEN; 
Valencia, CA) as specified in manufacturer's handbook. After 48 hr, the HAM-transfected 
COS-7 cells were extracted with J % Triton-XlOO in 20 mM Tris-HCl pH 7.4, 

The paragraph beginning on page 59 line 19 is re-written as follows: 

Amplification of an SNC19 CD NA from T-47D breast cancer ce)l«: An SNC19 
cDNA clone was generated by reverse transcriptase^ I ymerase chain reaction (RT-PCR) 
utilizing mRNA from T-47D human breast cancer cells. Primer sequences for SNC19 
(5 1 -CCTCCTCTTGGTCTTGCTGGGG-3* £SEQjD_£iQL3Q} and 5'-AGACCCGTCTGTTTT 

JO 
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CCAGG-3') L SBQJD NO: 11) were derived from the published sequence. Standard RT-PCR 
was conducted using the Advantage RT-PCR kit (Clontech; Palo Alto, CA). Products were 
analyzed on a 0.8% agarose gel and the result band of approximately 2.8 kb corresponding 
to the expected product sue was excised from the gel, purified and ligated into pCR2.1 
(Invitrogen, Carlsbad, CA) by TA cloning ( P CR-SNC19). 

The paragraph beginning on page 62 line 22 is re-written as follows: 

Although sequence analysis of the 40-kDa binding protein has shown it to be a serine 
protease inhibitor (see below), some residual gelatinolytic activity was observed for the 95- 
kDa matriptase/inhibitor complex (Fig. 8 Q. When matriptase and its binding protein were 
subjected to N-terminal sequencing, only 1 1 amino acid residues (WGGTDADEGE) fSEO 
^ NQ: 32) from matriptase were obtained with relatively low recovery^ and 12 amino acid 
residues (GPPPAPPGLPAG) (SEQ^NO: 2) were obtained from the amino-terminus of the 
40-kDa binding protein frw e been obtained . The 1 1 amino acid residues from matriptase 
were identical to a deduced amino acid sequence from a 2.9 kb cDNA called SNC19 
(accession number TJ20428). Numerous stop codons were observed in this deposited SNC19 
sequence, resulting in several small, predicted translation products. Thus, a 2,830 bp cDNA 
fragment was obtained by reverse transcriptase-polymerase chain reaction using two primers 
based on the sequence of SNC 19. There was extensive discrepancy (132 bases) between our 
sequence and that of SNC 19. 

The paragraph beginning on page 63 line 19 is re-written as follows: 

Nucleotide and predicted amino acid s equences of an matriptase cDNA clonp - A 
nucleotide (SEQ ID NO: 4) and an amino acid s^enees- sequence fSEQ ID NO: [[3]j 
5) of matriptase are shown in Fig. 9. The primers (20 bases at 5' end and 18 bases at 3' end) 
used for reverse transcriptase-polymerase chain reaction are underlined. Thirty three bases 
beyond the 51 end primer and 92 bases beyond 31 end primer were taken from SNC 19 
cDNA and incorporated. The cDNA sequence was translated from the fifth ATG (Met) 
codon in the open reading frame. NucJeotide and amino acjd numbers are shown on the left. 
Donble-underlines indicate sequences that agreed with the internal sequences obtained from 
matriptase. His-484, Asp-539 and Ser-633 were boxed and indicated che putative catalytic 
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tZuou and MWw, SStoT 

^^^^ 

triad ofmatriptase. Po^N^cos^^a.c.^^byA. A RGD sequence is 
indicated by A. 

~ M i;«r 1 1 is re-written as follows: 
The paragraph beginning on page 64 line 2.1 is re 

Matriptase cDNA is likely to be 2955 base pair long when the 5' end 33 bases and the 

. - «=«; mf .d to the fifth methionine codon because the 
The translation initiation s.te was asstgncd to the filth 

OTCATGG matches a favorable Ko*ak consensus sequence (Kozak et aL. w 

pair, long, and thus me deduced a^ino acid sequence was composed of 683 rescues, wt 
Lu>atedmolecolarmassof75,626. The two stretches of amino acid sequences 
(DYVEINGEK and WGGTDADEGE) iSiam^ «*— ft « 

(DY , ' „, S~ 6 ^ 443-153; thus the translation frame is hkely to be 

matriptase are located in aa 228-236 and aa fS e r /Thr1 
co, J. There are three potential N-glycosylatio„ sites with the canomca! Asn-X-CSer/Thr) 

f00 „d to media, interactions with integnns C^oslahti e< «U Scene* 238: 491-7 (,987)). 
The paragraph beginning on page 64 line 2 1 is re-written as follows: 
MI£iiD!3 ^^ A homology search for the deduced 

"at^ 

other serine proteases: (2) matriptase contains the invariant catalyuc mad; (3) matnp.ase 
chain reaction are underlined. Thirty-three bases beyond the 5' end pnmer and 92 bases 

;;: n r • en d ^ ^ *~ «*> ^ and ^ ^ ^ 

rinslated rom the HfthATGcodon in the open readin g frame. Kucleottde and am.no 
« — on the left. Double-underlines indicate sequences that agreed wtth 
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the internal sequences obtained from matriptase. His-484. Asp-539. and Ser-633 were boxed 
and mdicated the putative catalytic triad of matriptase. Potential N-glycosylation sites , 
indicated by A. A RGD sequence is indicated by 4>. 



> are 



The paragraph beginning on page 66 line 3 is re-written as follows: 

Referring more specifically to Figure 10, the C-terminal region (aa431-683) of 
matriptase (SBQ g) N r> .S) is compared with human trypsin fSEOIDNf>») human 
chymotrypsinlSfiarDNOMi. the catalytic chains of human enteropeptidase (SEP ID Nfv 
21, human hepsin (SEQIDNO- 10) , human blood coagulation factor XI (SEP ID NO- ^ } } 
and human plasminogen ISE PIPW: 12) . and the serine pro .ease domains of two 
transmembrane serine proteases, human TMPRSS2 ££EOJDNOj_8) and Drosophil, Stubble- 
stubbloidgeneCSb-sb^ISEP^HQUl. Residues are expressed in one letter code Gaps 
to maxtmize homologies are indicated by residues in the catalytic triads (matriptase His-f84 
Asp-539, and Ser-633) were boxed and indicated by ♦. The conserved activation motif 
(R/KVIGG) . SEPB?NO : , 3 4) was ^ ^ protco]ytic ^ ^ ^ ^ 

Eight conserved cysteines needed to form four intramolecular disulfide bonds are boxed and 
the hkely pairings are as follows: Cys-469-Cys-485. Cys-604-Cys-618, Cys-629-Cys-658 
and Cys-432-Cys.5S9. The disulfide bond (Cys*432-Cys-5 5 9) is observed in two-chain ' 
senne proteases, but not in trypsin and chymotrypsin. Residues in the substrate pocket (Asp- 
627, Gly-655. and Giy-665) are boxed and indicated by It is evident that me residue 
positioned a, the bottom of substrate pocket is Asp in trypsin-like proteases, inc.uding 
mamptase, but is Ser in chymotrvpsin. 

The paragraph beginning on page 66 line 20 is re-written as follows: 

A putative proteolytic activation site (Arg-442) of matriptase in a motif of A r *-Val- 
Val-Gly-Gly (RWGG) (SE 0 IDNO-^| is similar to the characteristic RIVGG (SEPm 
WHO. motif in other serine proteases. However, the He residue is replaced by Val residue 
This replacement is uncommon, but is observed in plasminogen. As mentioned above a 
conserved intramolecular disulfide bond is found in those serine proteases that are 
synthesized as one-chain zymogens and are proteolyucally activated to become active two 
cham forms. This disulfide bond is proposed to hold together the active catalytic frag^m 
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with their non-catalytic N-terminal fragments, thus serving as protein-protein interaction 
domain. Tins conserved intramolecular disulfide bond has been also observed in matnptase 
(Cys-432-Cys-559). These sequence analyses suggest that matripu.se may be synthesized as 
a single chain zymogen and may become proteolytic^ activated to a two-chain form. If 
this is a case, the majority of matnptase in the conditioned medium of T-47D breast cancer 
cells is likely to be the zymogen; the active two-chain matriptase only represents a minor 
proportion, consistent with the purified matriptase from T^7D human breast cancer cells 
exhibit an apparent size of 80-kDa under reduced conditions. This conclusion is also 
supported by the observation that the proposed N-terrninal sequences for the catalytic chain 
of matriptase are identical to the stretch of amino acid sequences (WGGTDADEGE) £SEO. 
TP NO: 37^ , which were obtained with very low recovery when matriptase was subjected to 
N-terminal sequencing. 

The paragraph beginning on page 67 line 14 is re-written as follows: 

The substrate specificity (S,) pocket of matriptase is likely to be composed of Asp- 
627 positioned at its bottom, with Gly-655 and Gly-665 at its neck, indicating mat matnptase 
is a typical trypsin-like serine protease. Predicted preferential cleavage at amino acid 
residues with positively charged side chains was conrtrmed with various synthetic substrates 
with Aro and Lys residues as PI sites [[(Fig. 1 1)1]. Specifically, matriptase was able to 
cleave the following synthetic substrates, presented as follows, from the most rapid to the 
siowest: Boc-Gln-AlaArg-AMC, Boc-benzyl-Giu-Gly-Arg-AMC. Boc-l*u-Gly-Arg-AMC, 
Bocbenzyl-Asp-Pro-Arg-AMC, Boc-Phe-Ser-Arg-AMC, Boc-LeU-Arg-Arg-AMC, Boc-Gly- 
Lys-Arg-AMC, and Boc-Leu-Ser-Thr-Arg-AMC. [Boc = r-butyloxycarbonyl; AMC - 7- 
amino-lmethylcoumarin; IDL = low density lipoprotein]. This supports the view that 
matriptase prefers substrates with amino acid residues containing small side chains, such as 
Ala and Gly as P2 sites. These results suggest that mamptase, in analogy with trypsin, 
exhibits broad spectrum cleavage specificity. This broad spectrum cleavage activity is likely 
to be the explanation of its gelatinolytic activity. Its trypsin-like activity appears to be 
distinct from Gelatinases A and B, which may cleave gelatin at glycine residues, the most 
abundant (almost up to one third of) amino acid residues in gelatin. 
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The paragraph beginning on page 68 line 6 is re-written as follows: 

c,.t„ T . mntif. rf* ivtto region of matriptase: The non-catalytic region of 

matript.se contains two sets of repeating sequences, which may serve as a regulatory and/or 
binding domain for interaction with other proteins. Four tandem repeats of about 35 armno 
acids including 6 conserved cysteine residues (Fig. [(12 A)] UA) were found at the am.no 
terminal region (aa 280-430) of its serine protease domain. They are homologous with the 
cysteine-containing repeat of the LDL receptor (Sudhof et al, Science 228: 815-22 (1985)) 
and related proteins (Herz el al.EMBO J. 7:41 19-27 (1988))- A.1 of these cysteine residues 
are likely be involved in disulfide bonds. In LDL receptor, the homologous, seven repeating 
sequences serve as the ligand binding domain. By analogy, the four tandem cysteine- 
containing repeats may also be the sites of interaction with other macromolecules. In 
addition, the cysteine-containing LDL receptor domain was found in other proteases, such as 
enteropeptidase (Matsushima etaL. J. Biol. Cbem. 269: 19976-82 (1994); and Kitamoto et 
al, Proc. Nad. Acad. Sci. USA 91: 7588-92 (1994)). 

The paragraph beginning on page 68 line 21 is re-written as follows: 

Referring to Figure [[12A]] 1_LA, the cysteine-rich repeats of matriptase (aa 280-314, 
aa 315-351, aa 352-387. and aa 394-130) (SEP IP NO: 15) are compared with the consensus 
sequences of the human LDL receptor CSF.O ID NO: 16); LDL receptor-related protein (LRP) 
, SF niDNO: 171 : human perlecan fSPnmNO: 18^: and rat GP-300 (SF-OTDNO; 191 The 
consensus sequences are boxed. In Figure [[12BJ] 11B, Clr/s type sequences of matriptase 
(aa 42-155 and aa 168-268) f^rt m MPS: 20and2l) are compared with selected domains 
of human complement subcomponent Clr (aa 193-298) f SF.O TD NQ; 22) , Cls (aa 175-283) 
rSFmriNO: 231 Ra-reactive factor (RaRF) (aa 185-290) (SEP ID NO: 24) , and a calcium 
dependent serine protease (CSP) (aa 181-289) ( SF.Q ID NO: 25). The most consensus 
sequences are boxed. 

The paragraph beginning on page 69 line 4 is re-written as follows: 

The amino-terminal region of matriptase (aa 42-268) (SFO TD NQS; £0 and 21) 
contains another two tandem segments with internal homology. THese segments resemble 
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partia. sequences, original* identified in cogent subcomponents C lr O^ytus ^ 
LdJL, 2* 4355-63 0 986); and Joumet ^^^..240:783-7( 986))^ 

85 16-24 (1987,). This CWs domain was also found in other serine proteases. tnc.udmg Ra- 
reactive facto, a C4/C2-activatin g component, enteropeptidase, an activator of trypsmogen 
(Maxima - aU (1994); Kitamoto e, aU (1994)), a,d a calcium-dependent senne protease 
L is able to degrade extracellular matrix. These Clr/s-containing serirte proves appear to 
* involved either in a protease activation cascade or in extracellular matrix degradation. * 
ad diUon,*ere are at least six m^ 

«Ud> were found to contain this Clr/s domain. These inc.ude bone morphogeny protem-l 
(Wozney « aL. Science 242: 1528-34 (1988)). and Drosophila tolloui gene, a dorsal-ventral 
patterning protein (Shimel. m aL. Cell 67: 469-81 (199!)). quail ., 25^^^- 
ind uced astacin like metaUopeptidase that may play a role in the degradation of eggshell 
rnatnx, s ea urchin b.astula proteased (that could be involved in the differentiate of 
ectodermal lineages and subsequent patterning of the embryo), Xenopus embryonic protem 
UVS 2, a marker for developmental stage, and sea urchin VEB gene that is expressed * a 
spatially restricted pattern during the very early b.astula stage of development. The majonty 
of these Clr/s-containing, astacin mctalloproteases appear to play a role in protein-protem 
interactions and embryonic development. The CI r/s domain has been also found m 
nonprotease proteins. These include neuropil (A5 protein), a calcium-independent cell 
adhesion molecule that is developmentally-expressed in the nervous system and tumor 

necrosis factor-inducible protein TSG-6, a hyaluronate-binding protein that may be involved 
in cell-cell and cell-matrix interaction during inflammation and tumorigenests. 
The paragraph beginning on page 78 line 12 is re-written as follows: 
To determine the position of the cleavage site for the generation of the two-chain form 
of matriptase, the 45- and 25-kDa components were each subjected to [[N-terininal]] Hz 
terminal amino acid sequence analyses. The amino acid residues obtained from the 25-kDa B 
chain were VVGGTDADEGEWP fPPQ IP NO: 37). This sequence begins wrth the l.lcely 
cleavage site within the activation motif in matriptase. When the 45-kDa A chain (includmg 
W o major plus one minor spots) was sequenced, two overlapping sequences (SFVVTSVVAF 
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iyrr><;i fTVDPT fSFQ TP NO: 381 : TVQRXQDNSCSFGLHARGVE £ SEQ ID NO; 3g » were 
obtained, and both matched sequences close to the amino terminus of matriptase. These two 
different amino -terminal sequences may be derived from the two major spots of matriptase A 
chain and suggest that the different migration rates of the two components result from their 
different amino termini. 
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